1 



FEB 2 1 



SEQUENCE LISTING 




^gpGENERAL INFORMATION: 

(i) APPLICANT: 

(A\NAME: Max- Planck-Gesel lschaf t zur Forderung der 

Wissenschaf ten e.V. 
(b) Street : none 

(C) CPTY: Berlin 

(D) STATE: none 

(E) COUNTRY: Germany 

(F) POSTA^ CODE (ZIP) : none 

(A) NAME: National Public Health Institute 

(B) STREET: Mannerheimint ie 166 

(C) CITY: Her^inki 

(D) STATE: nor 

(E) COUNTRY: Finland 

(F) POSTAL CODEVZIP) : 00300 

(ii) TITLE OF INVENTIOnV Novel nucleic acid molecule encoding a 
(poly) peptide co^segregating in mutated form with 
Autoimmune Polyenopcrinopathy Candidiasis Ectodermal 
Dystrophy (APECED) 

(iii) NUMBER OF SEQUENCES: 32^ 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DO)B/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

CGGGCGCACA GCCGGCGCGG AGGCCCCACA GCCCCGCCGG GACCCGAGGC &AAGCGAGGG 

GCTGCCAGTG TCCCGGGACC CACCGCGTCC GCCCCAGCCC CGGGTCCCCG CGCCCACCCC 

ATGGCGACGG ACGCGGCGCT ACGCCGGCTT CTGAGGCTGC ACCGCACGGA GATCG^GGTG 

GCCGTGGACA GCGCCTTCCC ACTGCTGCAC GCGCTGGCTG ACCACGACGT GGTCCCCG&G 

GACAAGTTTC AGGAGACGCT TCATCTGAAG GAAAAGGAGG GCTGCCCCCA GGCCTTCCAC 



GCCCTCCTGT CCTGGCTGCT G AC CCAGGAC TCCACAGCCA TCCTGGACTT CTGGAGGGTG 



\ 2 

CTGTTCAAGG ACTACAACCT GGAGCGCTAT GGCCGGCTGC AGCCCATCCT GGACAGCTTC 42 0 

CCCAAAGATGVrGGACCTCAG CCAGCCCCGG AAGGGGAGGA AGCCCCCGGC CGTCCCCAAG 4 80 

GCTTTGGTAC OGCCACCCAG ACTCCCCACC AAGAGGAAGG CCTCAGAAGA GGCTCGAGCT 54 0 

GCCGCGCCAG CAGCCCTGAC TCCAAGGGGC ACCGCCAGCC CAGGCTCTCA ACTGAAGGCC 600 

AAGCCCCCCA AGAAGfeCGGA GAGCAGCGCA GAGCAGCAGC GCCTTCCACT CGGGAACGGG 660 

ATTCAGACCA TGTCAGC^TC AGTC CAGAGA GCTGTGGCCA TGTCCTCCGG GGACGTCCCG 72 0 

GGAGCCCGAG GGGCCGTGG^ GGGGATCCTC ATCCAGCAGG TGTTTGAGTC AGGCGGCTCC 7 80 

AAGAAGTGCA TCCAGGTTGG VgGGGAGTTC TACACTCCCA GCAAGTTCGA AGACTCCGGC 84 0 

AGTGGGAAGA ACAAGGCCCG C^GCAGCAGT GGCCCGAAGC CTCTGGTTCG AG C C AAGGGA 900 

GCCCAGGGCG CTGCCCCCGG TGgXgGTGAG GCTAGGCTGG GCCAGCAGGG CAGCGTTCCC 960 

GCCCCTCTGG CCCTCCCCAG TG AC cfcc C AG CTCCACCAGA AGAATGAGGA CGAGTGTGCC 102 0 

GTGTGTCGGG ACGGCGGGGA GCTCATCTGC TGTGACGGCT GCCCTCGGGC CTTCCACCTG 10 80 

GCCTGCCTGT CCCCTCCGCT CCGGGAGATC CCCAGTGGGA CCTGGAGGTG CTCCAGCTGC 114 0 

CTGCAGGCAA CAGTCCAGGA GGTGCAGCCcXcGGGCAGAGG AGCCCCGGCC CCAGGAGCCA 12 0 0 

CCCGTGGAGA CCCCGCTCCC CCCGGGGCTT AlSGTCGGCGG GAGAGGAGGT AAGAGGTCCA 12 60 

CCTGGGGAAC CCCTAGCCGG CATGGACACG ACTCTTGTCT ACAAGCACCT GCCGGCTCCG 13 2 0 

CCTTCTGCAG CCCCGCTGCC AGGGCTGGAC TCCTOGGCCC TGCACCCCCT ACTGTGTGTG 13 80 

GGTCCTGAGG GTCAGCAGAA CCTGGCTCCT GGTGCGQJGTT GCGGGGTGTG CGGAGATGGT 144 0 

ACGGACGTGC TGCGGTGTAC TCACTGCGCC GCTGCCTTfcc ACTGGCGCTG CCACTTCCCA 1500 

GCCGGCACCT CCCGGCCCGG GACGGGCCTG CGCTGCAGAtV CCTGCTCAGG AGACGTGACC 15 60 

CCAGCCCCTG TGGAGGGGGT GCTGGCCCCC AGCCCCGCCC GtCTGGCCCC TGGGCCTGCC 162 0 

AAGGATGACA CTGCCAGTCA CGAGCCCGCT CTGCACAGGG ATG^CCTGGA GTCCCTTCTG 168 0 

AGCGAGCACA CCTTCGATGG CATCCTGCAG TGGGCCATCC AGAGcVrGGC CCGTCCGGCG 174 0 

GCCCCCTTCC CCTCCTGACC CCAGATGGCC GGGACATGCA GCTCTGA^GA GAGAGTGCTG 18 0 0 

AGAAGGACAC CTCCTTCCTC AGTCCTGGAA GCCGGCCGGC TGGGATCAAG, AAGGGGACAG 18 60 

CGCCACCTCT TGTCAGTGCT CGGCTGTAAA CAGCTCTGTG TTTCTGGGGA OACCAGCCAT 192 0 

CATGTGCCTG GAAATTAAAC CCTGCCCCAC TTCTCTACTC TGGAAGTCCC CGGGAGCCTC 1980 

TCCTTGCCTG GTGACCTACT AAAAATATAA AAATTAGCTG GGTGTGGTGG TGGGTGCCTG 2 04 0 

TAATCCCAGC TACATGGGAG CCTGAGGCAT GAGAATCACT TGAACTCGGG AGGTGGAlSGT 210 0 

TGCAGTGAGC TGAGATTGCG CCACTGCACT CCAGTCTGGT CGGCAAGAGT GAGACTCCG*t 2160 

CTCAAAAACA AAACAAAAAA ACCACATAAC ATAAATTTAT CATCTCGACC ACTTTTCAGT \ 2220 



TCAGTGGCAT TCACATCTCA TGTAA 



2245 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) VjENGTH: 545 amino acids 

(B) T^PE: amino acid 

(C) SraANDEDNESS : single 

(D) TOKOLOGY: linear 

(ii) MOLECULE Vype ; protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



Met Ala Thr Asp Ala\Ala Leu Arg Arg 
1 5 



Leu Leu Arg Leu His Arg Thr 
10 15 



Glu lie Ala Val Ala Vad Asp Ser Ala 
20 \ 25 



Phe Pro Leu Leu His Ala Leu 
30 



Ala Asp His Asp Val Val Rro Glu Asp 
35 \ 40 



Lys Phe Gin Glu Thr Leu His 
45 



Leu Lys Glu Lys Glu Gly Cys\Pro Gin 
50 55 



Ala Phe His Ala Leu Leu Ser 
60 



Trp Leu Leu Thr Gin Asp Ser Th\ Ala 
65 70 



lie Leu Asp Phe Trp Arg Val 
75 80 



Leu Phe Lys Asp Tyr Asn Leu Glu ^g 
85 



Tyr Gly Arg Leu Gin Pro lie 
90 95 



Leu Asp Ser Phe Pro Lys Asp Val Asp 
100 105 



^eu Ser Gin Pro Arg Lys Gly 
110 



Arg Lys Pro Pro Ala Val Pro Lys Ala 
115 120 



Leu\Val Pro Pro Pro Arg Leu 
125 



Pro Thr Lys Arg Lys Ala Ser Glu Glu 
130 135 



Ala Arg Ala Ala Ala Pro Ala 
140 



Ala Leu Thr Pro Arg Gly Thr Ala Ser 
145 150 



Pro Gly S€*r Gin Leu Lys Ala 
155 \ 160 



Lys Pro Pro Lys Lys Pro Glu Ser Ser 
165 



Ala Glu Gin &ln Arg Leu Pro 
170 \ 175 



Leu Gly Asn Gly lie Gin Thr Met Ser 
180 185 



Ala Ser Val Gin Arg Ala Val 
19* 



Ala Met Ser Ser Gly Asp Val Pro Gly 
195 200 



Ala Arg Gly Ala Val Glsu Gly 
205 



He Leu He Gin Gin Val Phe Glu Ser 
210 215 



Gly Gly Ser Lys Lys Cys II* 
220 



Gin 



Val Gly Gly Glu Phe Tyr Thr Pro Ser Lys Phe Glu Asp Ser Gly 



225 



230 



235 



240 



Ser Gly Lys Asn Lys Ala Arg Ser Ser Ser Gly Pro Lys Pro Leu Val 
245 250 255 



Arg Ala Lys Gly\Ala Gin Gly Ala Ala Pro Gly Gly Gly Glu Ala Arg 
260 \ 265 270 

Leu Gly Gin Gin GlV Ser Val Pro Ala Pro Leu Ala Leu Pro Ser Asp 
275 \ 280 285 

Pro Gin Leu His Gin Ws Asn Glu Asp Glu Cys Ala Val Cys Arg Asp 
290 \ 295 300 

Gly Gly Glu Leu lie Cyk Cys Asp Gly Cys Pro Arg Ala Phe His Leu 
305 31o\ 315 320 

Ala Cys Leu Ser Pro Pro Beu Arg Glu lie Pro Ser Gly Thr Trp Arg 
325 \ 330 335 

Cys Ser Ser Cys Leu Gin AlaVrhr Val Gin Glu Val Gin Pro Arg Ala 
340 \ 345 350 

Glu Glu Pro Arg Pro Gin Glu Pro Pro Val Glu Thr Pro Leu Pro Pro 
355 36\ 365 

Gly Leu Arg Ser Ala Gly Glu Glu Val Arg Gly Pro Pro Gly Glu Pro 
370 375 \ 380 

Leu Ala Gly Met Asp Thr Thr Leu Vai Tyr Lys His Leu Pro Ala Pro 
385 390 \ 395 400 

Pro Ser Ala Ala Pro Leu Pro Gly Leu Aep Ser Ser Ala Leu His Pro 
405 4M 415 

Leu Leu Cys Val Gly Pro Glu Gly Gin Gln\Asn Leu Ala Pro Gly Ala 
420 425 \ 430 

Arg Cys Gly Val Cys Gly Asp Gly Thr Asp vkl Leu Arg Cys Thr His 
435 440 \ 445 

Cys Ala Ala Ala Phe His Trp Arg Cys His Phe Vro Ala Gly Thr Ser 
450 455 4s60 

Arg Pro Gly Thr Gly Leu Arg Cys Arg Ser Cys Ser Gly Asp Val Thr 
465 470 475 \ 480 

Pro Ala Pro Val Glu Gly Val Leu Ala Pro Ser Pro Ala Arg Leu Ala 
485 490 \ 495 

Pro Gly Pro Ala Lys Asp Asp Thr Ala Ser His Glu Pro\Ala Leu His 
500 505 \l0 

Arg Asp Asp Leu Glu Ser Leu Leu Ser Glu His Thr Phe As\ Gly lie 
515 520 525 



Leu Gin Trp Ala lie Gin Ser Met Ala Arg Pro Ala Ala Pro Phe Pro 
530 535 540 



Ser 



545 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TOPE: nucleic acid 

(C) ST^RANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE NT YPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACAGGCAGGC AG 



12 




(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 bale pairs 

(B) TYPE: nucleic kcid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: lineai 

(ii) MOLECULE TYPE: DNA (ge\omic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ \D NO: 4: 
ACAGGCAGGC CA 



A2) INFORMATION FOR SEQ ID NO : 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5; 
GCAGGCCAGG TG 



12 



12 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(xi) sequence Description: seq id no: 6: 

GCGGAGGGGA CA 



12 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9o\base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNE S s\ double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ^genomic) 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: &EQ ID NO : 7: 
GTGTGGACTG TCACGGAAAC CCCCACGTGT C^ATGGAAAGT CCAAAATTCT ACAGGAGTCT 
TTCTGTTGAT CTCCAGTCAG AGGCTGGGGG 



60 
90 



(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
AAGGGGCTGG TGTGGAAAGC CCCACGGCAT GGTGGAAAGT CCGl 
CTTTGTTAAA CCTCCATGCA AGAGGCTGGG 



VTTCT ACAGGGGCCT 



60 
90 



(2) INFORMATION FOR SEQ ID NO : 9: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 



NNGNGGNNNG TNNNGNAANC CCCNNNGNNT GNTGGAAAGT CCNAAATTCT ACAGGNGNCT 

NTNTGTTNAN CNNCNNJNNN AGNNNNNGGG 

N represents any nucleotide or a gap 



60 
90 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: $659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CD 
( i i i ) HYPOTHETICAL : NO 
(xi) SEQUENCE DESCRIPTION :\ SEQ ID NO: 10: 

ATGGCAGGTG GGGATGGAAT GCTACGCCG^ CTGCTGAGGC TGCACCGCAC CGAGATCGCG 60 

GTGGCCATAG ACAGTGCCTT TCCGCTGCTG fcATGCTCTAG CCGACCACGA CGTGGTCCCT 12 0 

GAGGACAAGT TCCAGGAGAC GCTCCGTCTG AA^GAGAAGG AAGGCTGCCC CCAGGCCTTC 180 

CACGCCCTGC TGTCCTGGCT CCTGACCCGG GAC^pTGGGG CCATCCTGGA TTTCTGGAGG 24 0 

ATTCTCTTTA AGGACTACAA TCTGGAGCGG TACAG^GCC TGCATAGCAT CCTGGACGGC 300 

TTCCCAAAAG ATGTGGACCT AAACCAGTCC CGGAAAG&pA GAAAGCCCCT TGCTGGTCCC 360 

AAGGCCGCGG TACTGCCACC CAGACCCCCC ACCAAGAGA^ AAGCACTGGA GGAGCCTCGA 42 0 

GCCACCCCAC CAGCAACTCT GGCCTCAAAG AGCGTCTCCA OiCCCAGGCTC CCACCTGAAG 480 

ACTAAGCCCC CTAAGAAGCC AGATGGCAAC TTGGAGTCAC AG^ACCTTCC TCTTGGAAAC 54 0 

GGAATTCAGA CCATGGCAGC TTCTGTCCAG AGAGCTGTGA CCGTQLGCCTC TGGGGATGTT 60 0 

CCAGGAACCC GAGGGGCCGT GGAAGGGATC CTTATCCAGC AGGTGT'fcrGA GTCAGGAAGA 660 

TCCAAGAAGT GCATTCAGGT TGGGGGAGAG TTTTATACAC CCAACAAGl*T CGAAGACCCC 72 0 

AGTGGCAATT TGAAGAACAA GGCCCGGAGT GGTAGCAGCC TAAAGCCAGT ^GGTCCGAGCC 78 0 

AAGGGAGCCC AGGTCACTAT ACCTGGTAGA GATGAGCAGA AAGTGGGCCA GC^GTGTGGG 84 0 

GTTCCTCCCC TTCCATCCCT CCCCAGTGAG CCCCAGGTTA ACCAGAAGAA CGAGlSATGAG 900 

TGTGCCGTGT GCCACGACGG AGGTGAGCTC ATCTGTTGTG ACGGCTGTCC CCGGGcfcjTTC 960 

CACCTGGCTT GCCTGTCCCC ACCTCTGCAG GAGATCCCCA GTGGCCTCTG GAGATGCt\c 102 0 

TGCTGCCTCC AGGGCAGAGT CCAACAGAAC CTGTCCCAGC CTGAGGTGTC CAGGCCCCCG \ 1080 

GAGCTACCTG CAGAGACCCC GATCCTCGTG GGACTGAGGT CAGCTTCAGA GAAAACCAGG \114 0 



GGCCCATCCA 

GCCCCGCACC 

GGGAATGAGG 

ACCGAGGTGT 

ACGGCCGCCG 

CCCACGCCAG 
CTTGCCAAGG 

GAGTCCCTCC 

TCACGCCCGC 



GGGAGCTCAA AGCCAGCTCT 
CTGCAGCTCC TCTGCTGGAG 
GGCGGCCAGG TCCAGCACCA 
TGCGGTGTGC ACACTGTGCC 
CCCGGCCGGG GACCAATCTC 

GCACACCGGG\CGAAGCTGTA 

TAGGGGACGA CTCTGCTAGT 

v 

TCAATGAGCA CTCATTTGAC 
TGGCCGAGAC ACCACCCTTC 



GATGCTGCTG 

CCTTCAGCAC 

AGCGCGCGAT 

GCTGCCTTCC 

CGCTGCAAAT 

CCCACCTCTG 
CACGACCCTG 



TCACATATGT 

TGTGCCCTCT 

GCAGTGTGTG 

ACTGGCGCTG 

CCTGCTCTGC 

GGCCCCGTCC 
TTCTACATAG 



GAACCTGCTG 

ACTGAGTGCT 

TGGCGATGGC 

CCACTTCCCG 

AGACTCGACT 

AGCACCTGGG 
GGACGACCTG 



GGCATCCTGC AGTGGGCCAT CCAGAGCATG 
TCTTCCTGA 



1200 

1260 

1320 

1380 

1440 

1500 
1560 

1620 

1659 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amin\ acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : sing\e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID fro : 11: 

Met Ala Gly Gly Asp Gly Met Leu Arg\Arg Leu Leu Arg Leu His Arg 
15 \0 15 

Thr Glu lie Ala Val Ala lie Asp Ser AlV Phe Pro Leu Leu His Ala 
20 25 \ 30 

Leu Ala Asp His Asp Val Val Pro Glu Asp ^ys Phe Gin Glu Thr Leu 
35 40 \ 45 

Arg Leu Lys Glu Lys Glu Gly Cys Pro Gin Ala\phe His Ala Leu Leu 
50 55 

Ser Trp Leu Leu Thr Arg Asp Ser Gly Ala lie LeVi Asp Phe Trp Arg 
65 70 75 \ 80 

lie Leu Phe Lys Asp Tyr Asn Leu Glu Arg Tyr Ser \rg Leu His Ser 
85 90 \ 95 

lie Leu Asp Gly Phe Pro Lys Asp Val Asp Leu Asn Gln\ Ser Arg Lys 
100 105 \lio 

Gly Arg Lys Pro Leu Ala Gly Pro Lys Ala Ala Val Leu Pttp Pro Arg 
115 120 125 

Pro Pro Thr Lys Arg Lys Ala Leu Glu Glu Pro Arg Ala Thr kpo Pro 
130 135 140 

Ala Thr Leu Ala Ser Lys Ser Val Ser Ser Pro Gly Ser His Leu > 
145 150 155 1>60 



Thr Lys Pro Pro Lys Lys Pro Asp Gly Asn Leu Glu Ser Gin His Leu 
\ 165 170 175 

Pro Leu Gly Asn\Gly lie Gin Thr Met Ala Ala Ser Val Gin Arg Ala 



Val Thr Val Ala $er Gly Asp Val Pro Gly Thr Arg Gly Ala Val Glu 
195 \ 200 205 

Gly lie Leu lie Glh Gin Val Phe Glu Ser Gly Arg Ser Lys Lys Cys 
210 \ 215 220 

lie Gin Val Gly Gly Glu Phe Tyr Thr Pro Asn Lys Phe Glu Asp Pro 
225 2G0 235 240 

Ser Gly Asn Leu Lys Ask Lys Ala Arg Ser Gly Ser Ser Leu Lys Pro 
245 \ 250 255 

Val Val Arg Ala Lys Gly Ala Gin Val Thr lie Pro Gly Arg Asp Glu 
260 \ 265 270 

Gin Lys Val Gly Gin Gin Cys\Gly Val Pro Pro Leu Pro Ser Leu Pro 
275 ^80 285 

Ser Glu Pro Gin Val Asn Gin Lye Asn Glu Asp Glu Cys Ala Val Cys 
290 295 \ 300 

His Asp Gly Gly Glu Leu lie Cys Cys Asp Gly Cys Pro Arg Ala Phe 
305 310 \ 315 320 

His Leu Ala Cys Leu Ser Pro Pro LeA Gin Glu lie Pro Ser Gly Leu 
325 V330 335 

Trp Arg Cys Ser Cys Cys Leu Gin Gly Atg Val Gin Gin Asn Leu Ser 
340 345 \ 350 

Gin Pro Glu Val Ser Arg Pro Pro Glu LeuVpro Ala Glu Thr Pro lie 
355 360 \ 365 

Leu Val Gly Leu Arg Ser Ala Ser Glu Lys ThV Arg Gly Pro Ser Arg 
370 375 \380 

Glu Leu Lys Ala Ser Ser Asp Ala Ala Val Thr Wr Val Asn Leu Leu 
385 390 395 \ 400 

Ala Pro His Pro Ala Ala Pro Leu Leu Glu Pro Ser\Ala Leu Cys Pro 
405 410 \ 415 

Leu Leu Ser Ala Gly Asn Glu Gly Arg Pro Gly Pro Al\ Pro Ser Ala 
420 425 \430 

Arg Cys Ser Val Cys Gly Asp Gly Thr Glu Val Leu Arg C$s Ala His 
435 440 445 \ 

Cys Ala Ala Ala Phe His Trp Arg Cys His Phe Pro Thr Ala ma Ala 
450 455 460 \ 




185 



190 



Arg Pro Gly Thr Asn Leu Arg Cys Lys Ser Cys Ser Ala Asp Ser Vhr 
465 470 475 4^0 



10 

Pro Thr Pro Gly Thr Pro Gly Glu Ala Val Pro Thr Ser Gly Pro Arg 
\ 485 490 495 

Pro Ala Pro Gly Leu Ala Lys Val Gly Asp Asp Ser Ala Ser His Asp 
500 \ 505 510 



\ 



Pro Val Leu His Ar& Asp Asp Leu Glu Ser Leu Leu Asn Glu His Ser 
515 \ 520 525 

Phe Asp Gly lie Leu G^Ln Trp Ala lie Gin Ser Met Ser Arg Pro Leu 
530 \ 535 540 

Ala Glu Thr Pro Pro PheNger Ser 
545 550 



) INFORMATION FOR SEQ ID NO : \2 : 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 545 amino ac\ds 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Met Ala Thr Asp Ala Ala Leu Arg Arg Leu Vjeu Arg Leu His Arg Thr 
1 5 10 \ 15 

Glu lie Ala Val Ala Val Asp Ser Ala Phe Pr\ Leu Leu His Ala Leu 
20 25 \ 30 

Ala Asp His Asp Val Val Pro Glu Asp Lys Phe Gin Glu Thr Leu His 
35 40 \ 45 

Leu Lys Glu Lys Glu Gly Cys Pro Gin Ala Phe His Via Leu Leu Ser 
50 55 60 

Trp Leu Leu Thr Gin Asp Ser Thr Ala lie Leu Asp PheVrrp Arg Val 
65 70 75 \ 80 

Leu Phe Lys Asp Tyr Asn Leu Glu Arg Tyr Gly Arg Leu Gl\ Pro lie 
85 90 \95 

Leu Asp Ser Phe Pro Lys Asp Val Asp Leu Ser Gin Pro Arg l\s Gly 
100 105 110 

Arg Lys Pro Pro Ala Val Pro Lys Ala Leu Val Pro Pro Pro Arg \eu 
115 120 125 

Pro Thr Lys Arg Lys Ala Ser Glu Glu Ala Arg Ala Ala Ala Pro Ala^ 
130 135 140 

Ala Leu Thr Pro Arg Gly Thr Ala Ser Pro Gly Ser Gin Leu Lys Ala 
145 150 155 160 



Lys Pro Pro Lys Lys Pro Glu Ser Ser Ala Glu Gin Gin Arg Leu Pro 
165 170 175 



11 



Leu Gly Asn Gly 
180 

Ala Met Ser Ser 
195 

lie Leu lie Gin 
210 

Gin Val Gly Gly 
225 

Ser Gly Lys Asn 



Arg Ala Lys Gly 
260' 

Leu Gly Gin Gin 
275 

Pro Gin Leu His 
290 

Gly Gly Glu Leu 
305 

Ala Cys Leu Ser 



Cys Ser Ser Cys 
340 

Glu Glu Pro Arg 
355 

Gly Leu Arg Ser 
370 

Leu Ala Gly Met 
385 

Pro Ser Ala Ala 



Leu Leu Cys Val 
420 

Arg Cys Gly Val 
435 

Cys Ala Ala Ala 
450 

Arg Pro Gly Thr 
465 

Pro Ala Pro Val 



lie Gin Thr Met 



Gly Asp Val Pro 
\ 200 

GlnWal Phe Glu 
\ 215 

Glu Pne Tyr Thr 
23(3 

Lys Ala \Arg Ser 
245 \ 



Ala Gin Gl\\ Ala 



Gly Ser Val Pro 
2^0 

Gin Lys Asn Glu 
295 

lie Cys Cys Asp 
310 

Pro Pro Leu Arg 
325 

Leu Gin Ala Thr 



Pro Gin Glu Pro 
360 

Ala Gly Glu Glu 
375 

Asp Thr Thr Leu 
390 

Pro Leu Pro Gly 
405 

Gly Pro Glu Gly 



Cys Gly Asp Gly 
440 

Phe His Trp Arg 
455 

Gly Leu Arg Cys 
470 

Glu Gly Val Leu 
485 



Ser Ala Ser Val 
185 

Gly Ala Arg Gly 



Ser Gly Gly Ser 
220 

Pro Ser Lys Phe 
235 

Ser Ser Gly Pro 
250 



Ala Pro Gly Gly 
265 

Ala Pro Leu Ala 



.Asp Glu Cys Ala 
\ 300 

G3W Cys Pro Arg 

\ 315 

Glu ule Pro Ser 
33 0 

Val GlV Glu Val 
345 \ 

Pro Val ffllu Thr 



Val Arg GlA Pro 
\380 

Val Tyr Lys His 
395 \ 

Leu Asp Ser Ser 
410 

Gin Gin Asn Leu 
425 

Thr Asp Val Leu 



Cys His Phe Pro 
460 

Arg Ser Cys Ser 
475 

Ala Pro Ser Pro 
490 



Gin Arg Ala Val 
190 

Ala Val Glu Gly 
205 

Lys Lys Cys lie 



Glu Asp Ser Gly 
240 

Lys Pro Leu Val 
255 



Gly Glu Ala Arg 
270 

Leu Pro Ser Asp 
285 

Val Cys Arg Asp 



Ala Phe His Leu 
320 

Gly Thr Trp Arg 
335 

Gin Pro Arg Ala 
350 

Pro Leu Pro Pro 
365 

Pro Gly Glu Pro 



Leu Pro Ala Pro 
400 

Ala Leu His Pro 
\ 415 

Al^a Pro Gly Ala 
\ 430 

Arg hys Thr His 
445 \ 

Ala GlyVrhr Ser 



Gly Asp Va\ Thr 
\480 

Ala Arg Leu AJ\a 
495 \ 



12 



Pro Gly Pro Ala\Lys Asp Asp Thr Ala Ser His Glu Pro Ala Leu His 
500\ 505 510 



Arg Asp Asp Leu Glu Ser Leu Leu Ser Glu His Thr Phe Asp Gly lie 

515 ^ 520 525 

Leu Gin Trp Ala Il\e Gin Ser Met Ala Arg Pro Ala Ala Pro Phe Pro 

530 \ 535 540 





Ser 
545 



(2) INFORMATION FOR SEQ ID N© : 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino\ acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : sing\e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID^O: 13: 

Met Ala Gly Gly Asp Gly Met Leu Ar\g Arg Leu Leu Arg Leu His Arg 
1 5 \ 10 15 

Thr Glu lie Ala Val Ala lie Asp Ser kla Phe Pro Leu Leu His Ala 
20 25 \ 30 

Leu Ala Asp His Asp Val Val Pro Glu Ash Lys Phe Gin Glu Thr Leu 
35 40 \ 45 

Arg Leu Lys Glu Lys Glu Gly Cys Pro Gin Ala Phe His Ala Leu Leu 
50 55 \ 60 

Ser Trp Leu Leu Thr Arg Asp Ser Gly Ala Ile\ Leu Asp Phe Trp Arg 
65 70 75 \ 80 

He Leu Phe Lys Asp Tyr Asn Leu Glu Arg Tyr sW Arg Leu His Ser 
85 90 \ 95 

He Leu Asp Gly Phe Pro Lys Asp Val Asp Leu AsnNcin Ser Arg Lys 
100 105 \ 110 

Gly Arg Lys Pro Leu Ala Gly Pro Lys Ala Ala Val Le\ Pro Pro Arg 
115 120 125 

Pro Pro Thr Lys Arg Lys Ala Leu Glu Glu Pro Arg Ala T&r Pro Pro 
130 135 140 

Ala Thr Leu Ala Ser Lys Ser Val Ser Ser Pro Gly Ser His heu Lys 
145 150 155 \ 160 



Thr Lys Pro Pro Lys Lys Pro Asp Gly Asn Leu Glu Ser Gin His Yeu 
165 170 175 



13 



Pro Leu Gly Asn Gly He Gin Thr Met Ala Ala Ser Val Gin Arg Ala 



180 



\ 



185 



190 



Val Thr Val Ala SeA Gly Asp Val Pro Gly Thr Arg Gly Ala Val Glu 
195. \ 200 205 

Gly He Leu He Gin Gin Val Phe Glu Ser Gly Arg Ser Lys Lys Cys 
210 \ 215 220 

He Gin Val Gly Gly Gl\i Phe Tyr Thr Pro Asn Lys Phe Glu Asp Pro 
225 230\ 235 240 

Ser Gly Asn Leu Lys Asn Ala Arg Ser Gly Ser Ser Leu Lys Pro 

245 \ 250 255 



Val Val Arg Ala Lys Gly Ala\ Gin Val Thr He Pro Gly Arg Asp Glu 
260 \ 265 270 

Gin Lys Val Gly Gin Gin Cys day Val Pro Pro Leu Pro Ser Leu Pro 
275 2&0 285 

Ser Glu Pro Gin Val Asn Gin LysVAsn Glu Asp Glu Cys Ala Val Cys 
290 295 \ 300 

His Asp Gly Gly Glu Leu He Cys C^s Asp Gly Cys Pro Arg Ala Phe 
305 310 \ 315 320 

His Leu Ala Cys Leu Ser Pro Pro Leu ^Gln Glu He Pro Ser Gly Leu 
325 i^O 335 

Trp Arg Cys Ser Cys Cys Leu Gin Gly Ar^Val Gin Gin Asn Leu Ser 
340 345 \ 350 

Gin Pro Glu Val Ser Arg Pro Pro Glu Leu P^so Ala Glu Thr Pro He 
355 360 \ 365 

Leu Val Gly Leu Arg Ser Ala Ser Glu Lys Thr \rg Gly Pro Ser Arg 
370 375 3^0 

Glu Leu Lys Ala Ser Ser Asp Ala Ala Val Thr Tyr\ Val Asn Leu Leu 
385 390 395 \ 400 

Ala Pro His Pro Ala Ala Pro Leu Leu Glu Pro Ser Ai^a Leu Cys Pro 
405 410 \ 415 

Leu Leu Ser Ala Gly Asn Glu Gly Arg Pro Gly Pro Ala Wo Ser Ala 
420 425 Ao 

Arg Cys Ser Val Cys Gly Asp Gly Thr Glu Val Leu Arg Cys\Ala His 
435 440 445 

Cys Ala Ala Ala Phe His Trp Arg Cys His Phe Pro Thr Ala 
450 455 460 

Arg Pro Gly Thr Asn Leu Arg Cys Lys Ser Cys Ser Ala Asp Ser T\r 
465 470 475 48* 



Pro Thr Pro Gly Thr Pro Gly Glu Ala Val Pro Thr Ser Gly Pro Arg 
485 490 495 
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Pro Ala Pro Gly *Leu Ala Lys Val Gly Asp Asp Ser Ala Ser His Asp 
500 \ 505 510 

Pro Val Leu His Ara Asp Asp Leu Glu Ser Leu Leu Asn Glu His Ser 
515 \ 520 525 

Phe Asp Gly lie Leu\Gln Trp Ala lie Gin Ser Met Ser Arg Pro Leu 
530 \ 535 540 

Ala Glu Thr Pro Pro I*he Ser Ser 
545 550 



(2) INFORMATION FOR SEQ ID N&: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 aminV acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ TQ NO: 14; 




Xaa Xaa Xaa Asp Xaa Xaa Leu Arg Arg Leu Leu Arg Leu His Arg Thr 
1 5 \ 10 15 

Glu lie Ala Val Ala Xaa Asp Ser Alk Phe Pro Leu Leu His Ala Leu 
20 25 \ 30 

Ala Asp His Asp Val Val Pro Glu Asp Lys Phe Gin Glu Thr Leu Xaa 
35 40 \ 45 

Leu Lys Glu Lys Glu Gly Cys Pro Gin Alk Phe His Ala Leu Leu Ser 
50 55 \ 60 

Trp Leu Leu Thr Xaa Asp Ser Xaa Ala lie Leu Asp Phe Trp Arg Xaa 
65 70 75\ 80 

Leu Phe Lys Asp Tyr Asn Leu Glu Arg Tyr Xaa Vrg Leu Xaa Xaa lie 
85 90 \ 95 

Leu Asp Xaa Phe Pro Lys Asp Val Asp Leu Xaa Gln\Xaa Arg Lys Gly 
100 105 \ 110 

Arg Lys Pro Xaa Ala Xaa Pro Lys Ala Xaa Val Xaa PrV Pro Arg Xaa 
115 120 125^ 

Pro Thr Lys Arg Lys Ala Xaa Glu Glu Xaa Arg Ala Xaa x\a Pro Ala 
130 135 140 



Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Ser Pro Gly Ser Xaa Leu Ws Xaa 
145 150 155 \ 160 



Lys Pro Pro Lys Lys Pro Xaa Xaa Xaa Xaa Glu Xaa Gin Xaa Leu\J?ro 
165 170 175 \ 



\ 

Leu Gly Asn Gly lie Gin Thr Met Xaa Ala Ser Val Gin Arg Ala Val 
180 \ 185 190 

Xaa Xaa Xaa Ser Gly Asp Val Pro Gly Xaa Arg Gly Ala Val Glu Gly 
195 \ 200 205 

lie Leu lie Gin Gin Val\ Phe Glu Ser Gly Xaa Ser Lys Lys Cys lie 
210 \215 220 

Gin Val Gly Gly Glu Phe 'Wr Thr Pro Xaa Lys Phe Glu Asp Xaa Ser 
225 230 \ 235 240 

Gly Xaa Xaa Lys Asn Lys Alk Arg Ser Xaa Ser Xaa Xaa Lys Pro Xaa 
245 \ 250 255 



Val Arg Ala Lys Gly Ala Gin Xaa Xaa Xaa Pro Gly Xaa Xaa Glu Xaa 
260 \ 265 270 

Xaa Xaa Gly Gin Gin Xaa Xaa Val Pro Xaa Xaa Xaa Xaa Leu Pro Ser 
275 28o\ 285 

Xaa Pro Gin Xaa Xaa Gin Lys Asn Glu Asp Glu Cys Ala Val Cys Xaa 
290 295 \ 300 

Asp Gly Gly Glu Leu lie Cys Cys Asp Gly Cys Pro Arg Ala Phe His 
305 310 \ 315 320 

Leu Ala Cys Leu Ser Pro Pro Leu XaaWu lie Pro Ser Gly Xaa Trp 
325 $30 335 

Arg Cys Ser Xaa Cys Leu Gin Xaa Xaa val Gin Xaa Xaa Xaa Xaa Xaa 
340 345 \ 350 

Xaa Glu Xaa Xaa Arg Pro Xaa Glu Xaa Pro\Xaa Glu Thr Pro Xaa Xaa 
355 360 \ 365 

Xaa Gly Leu Arg Ser Ala Xaa Glu Xaa Xaa AJrg Gly Pro Xaa Xaa Glu 
370 375 V 380 

Xaa Xaa Ala Xaa Xaa Asp Xaa Xaa Xaa Xaa TyryCaa Xaa Leu Xaa Ala 
385 390 395 \ 400 

Pro Xaa Xaa Ala Ala Pro Leu Xaa Xaa Leu Xaa x4a Ser Ala Leu Xaa 
405 410 \ 415 

Pro Leu Leu Xaa Xaa Gly Xaa Glu Gly Xaa Xaa XaaVaa Ala Pro Xaa 
420 425 \ 430 

Ala Arg Cys Xaa Val Cys Gly Asp Gly Thr Xaa Val Le^ Arg Cys Xaa 
435 440 44! 

His Cys Ala Ala Ala Phe His Trp Arg Cys His Phe Pro \aa Xaa Xaa 
450 455 460 

Xaa Arg Pro Gly Thr Xaa Leu Arg Cys Xaa Ser Cys Ser Xaa\ Asp Xaa 
465 470 475 \ 480 



Thr Pro Xaa Pro Xaa Xaa Xaa Gly Xaa Xaa Xaa Pro Xaa Ser Xaa Xaa 
485 490 495 
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Arg Xaa Ala Pro Gly Xaa Ala Lys Xaa Xaa Asp Asp Xaa Ala Ser His 



500 



505 



510 



Xaa Pro Xaa Leu His Arg Asp Asp Leu Glu Ser Leu Leu Xaa Glu His 
515 \ 520 525 

Xaa Phe Asp Gly Ile\ Leu Gin Trp Ala lie Gin Ser Met Xaa Arg Pro 
530 \ 535 540 

Xaa Ala Xaa Xaa Pro kaa 
545 ^50 

Xaa denotes any amino acid 

(2) INFORMATION FOR SEQ ID N^ : 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base\airs 

(B) TYPE: nucleic ac! 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
( i i i ) HYPOTHETICAL : YES 
(xi) SEQUENCE DESCRIPTION: SEQ Vc D NO: 15 
GGGGCCTCGA TGGACGTCTC TGGGGCCCAG GTOBTGGTTC GCGCGCTA 



48 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( i i i ) HYPOTHETICAL : NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 

Pro Glu Leu Pro Ala Glu Thr Pro Gly Pro Alk Pro Ser Ala Arg 
1 5 10 \ 15 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGTGAGCCCC AGGTTAACCAt GAACGAGGAT GAGTGTGCCG TGT 



43 




(2) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amirib acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: l\near 

(ii) MOLECULE TYPE: prlptein 

(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

Ser Glu Pro Gin Val Asn Gin Asn Glu Asp Glu Cys Ala Val 
1 5 \ 10 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 4 8 base pail 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic)^ 
(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NC 
GTCACCAGGC TCGGTTCCCT CGGGTCCCAT CTCTACTCO 
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(2) INFORMATION FOR SEQ ID NO : 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
Val Val Arg Ala Lys Gly Ala Gin Gly Arg Asp Glu G\n Lys Val 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
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( B ) TYPE-: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

\ 

(ii) MOLECULE TY,PE : other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

( i i i ) HYPOTHETICAL^ YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 

AGAAGTGCAT CCAGGTTGGC \ 



(2) INFORMATION FOR SEQ IEANO: 22: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 has\ pairs 

(B) TYPE: nucleic a\id 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear A 

(ii) MOLECULE TYPE: other nuftleic acid 

(A) DESCRIPTION: /desfc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES \ 

(xi) SEQUENCE DESCRIPTION: SEQ ik NO: 22: 

GGAAGAGGGG CGTCAGCAAT \ 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) 


MOLECULE TYPE: peptide 






(iii) 


HYPOTHETICAL : YES 






(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


23 : \ 




Met 
1 


Ala Thr Asp Ala Ala Leu Arg Arg 
5 


Leu Leu Arg 
10 


ieu His Arg 
\ 15 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: \YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Ser Gin Pro Arg Lys\Gly Arg Lys Pro Pro Ala Val Pro Lys 
1 5 \ 10 



(2) INFORMATION FOR SEQ IDi NO : 25: 

(i) SEQUENCE CHARACTER STICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino adid 

(C) STRANDEDNESS : feingle 

(D) TOPOLOGY: linedr 

(ii) MOLECULE TYPE: other \nucleic acid 

(A) DESCRIPTION: /tiesc = "oligonucleotide" 

(iii) H YPOTHET I CAL : YE S \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gly Gly Cys Thr Thr Cys Thr\Gly Ala Gly Gly Cys Thr Gly Cys Ala 
1 5 \ 10 15 

Cys Cys \ 



(2) INFORMATION FOR SEQ ID NO : 26: \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 19 base pairs \ 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS : single \ 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: other nucleic acid\ 

(A) DESCRIPTION: /desc = " oligonucleotide " 

(iii) HYPOTHETICAL : YES \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: \ 

GCTCTGGATG GCCTACTGC \ 



(2) INFORMATION FOR SEQ ID NO: 27: \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 2 0 base pairs \ 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single * 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(iii) HYPOTHETICAL: YES 



\ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
AG AAGTG CAT CCAGGTTGGC \ 



(2) INFORMATION FOR SEQ\ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 Wse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lihear 

(ii) MOLECULE TYPE: o titer nucleic acid 

(A) DESCRIPTION: \ /desc = "oligonucleotide" 

( i i i ) HYPOTHETICAL : YES 



(xi) SEQUENCE DESCRIPTION\ SEQ ID NO : 28: 
GTGTGCTCGC TCAGAAGGG 

(2) INFORMATION FOR SEQ ID NO: 3i9 : 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 18 base paii 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic\acid 

(A) DESCRIPTION: /desc = \ol igonucleot ide " 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO\ 29: 

TGGCAGGTGG GGATGGAA 

(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30: 

GGAGGGATGG AAGGGGAGGA 



(2) INFORMATION FOR SEQ ID NO : 31: 



Y 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH^ 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGYV linear 

(ii) MOLECULE TYPE:\other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 31: 

TCCCACCTGA AGACTAAGC \ 



(2) INFORMATION FOR SEQ ID NO:\32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS : single\ 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: other nucleifc acid 

(A) DESCRIPTION: /desc = \oligonucleot ide 

(iii) HYPOTHETICAL : YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



TCACAGCTCT CTGGACAGAA 
1 



